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Abstract —Two types of privacy-preserving decision making 
paradigms for utility-community interactions for multi-horizon 
operation are examined in this paper. In both designs, communi¬ 
ties with renewable energy sources, distributed generators, and 
energy storage systems minimize their costs with limited informa¬ 
tion exchange with the utility. The utility makes decision based 
on the information provided from the communities. Through 
an iterative process, all parties achieve agreement. The authors’ 
previous research results on subgradient and lower-upper-bound 
switching (LUBS)-based distributed optimization oriented multi¬ 
agent control strategies are examined and the convergence anal¬ 
ysis of both strategies are provided. The corresponding decision 
making architectures, including information flow among agents 
and learning (or iteration) procedure, are developed for multi¬ 
horizon decision making scenarios. Numerical results illustrate 
the decision making procedures and demonstrate their feasibility 
of practical implementation. The two decision making architec¬ 
tures are compared for their implementation requirements as 
well as performance. 

Index Terms —AC OFF, dual decomposition, moving horizon 
optimization, convergence property, spin reserve 


1. Introduction 

S IGNIFICANT increase in penetration of private agents 
having their own microgrids to the power network is 
highly expected in the future smart grid, which makes control 
and optimization of the power network more challenging 
than before. These microgrids may have different types of 
distributed energy resources (DER) such as renewable en¬ 
ergy systems (RES) and energy storage systems (ESS) to 
mitigate renewable energy intermittency. Storage systems are 
constrained by temporal limitations which makes decision 
making to consider multiple time horizons instead of just one 
snapshot. In addition, the private agents have their own specific 
interests and do not like to share their private information 
with the system operator due to economical or privacy-related 
reasons. Erom the operator’s point of view, it is desired to 
operate the entire power network including the microgrids 
in its optimal operation. In such optimization problems, the 
network constraints such as network congestion and voltage 
limits must be considered. Eurthermore, the spinning reserve 
is required, which guarantees the system be ready to respond 
to contingencies that affect power balance. 

V.R. Disfani, Z. Miao, and L. Fan are with the Department of Electrical 
Engineering, University of South Florida, Tampa, FL 33620 (Email: lingling- 
fan@usf.edu). B. Zeng is with the Department of Industrial Management and 
Science at University of South Florida. 


The objective of this paper is to develop privacy-preserving 
multi-horizon utility-community decision making paradigms 
considering AC power flow, network constraints, and reserve 
requirements. 

The mathematic foundation of privacy-preserving decision 
making is distributed optimization. In the literature, there are 
several approaches to develop distributed algorithms for both 
AC OPE Q-«5I and DC OPE 0-@ These algorithms de¬ 
compose an optimization problem to smaller subproblems and 
design an iterative process to seek the optimal solution, where 
one agent is responsible to each one of these subproblems. 
After the agents solve their own OPE problem (e.g, using using 
interior point method), these algorithms performs an update 
process to push the solution toward the optimal solution. The 
update process requires agents to share information. Therefore, 
each algorithm corresponds to an information fiow structure. 

A modified subgradient-based method has been proposed in 
the authors’ previous work j^, where microgrids and a utility 
name their desired levels of power transactions and a price 
updating scheme pushes the solution toward the optimal solu¬ 
tion. Another method proposed in 0 is named lower-upper- 
bound-witching (LUBS), where the microgrids update their 
desired prices based on the power demand of the main grid. 
The main grid defines its optimal power export/import level 
from each microgrid based on the prices. The convergence of 
the upper-bound and the lower-bound indicates the optimality 
of the solution. 

To handle the time-correlated constraints of ESS, OPE need 
to consider multiple time horizons instead of just one snapshot. 
Several approaches has been addressed in the literature to 
tackle multi-horizon OPE problem. Bender’s decomposition 
method is developed in fTQ| to tackle energy constraints of 
hydroelectric plants integrated in irrigation systems. Bender’ 
decomposition focuses on decomposition of integer decision 
variables and continuous decision variables and cannot be 
translated into a multi-agent control structure. Another thread 
of research GD (m develops a KKT-based solution to con¬ 
sider battery storage systems. An optimization model is also 
proposed in GD for multi-horizon OPE with wind generation 
and battery storage. The capability of ESS to provide ancillary 
services such as reserve is taken into account to tackle reserve 
constraints in an OPE problem without considering their time- 
correlated constraints GD GD 

To the authors’ best knowledge, the proposed research 
privacy-preserving decision making for multi-horizons while 
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considering AC OPF and spinning reserve, has not been seen 
in the literature. This paper extends the research in Q to ap¬ 
ply the privacy-preserving decision making to multi-horizons 
while considering spinning reserve requirements. Spinning 
reserve adequacy is a constraint which is considered not only 
in unit commitment problem fT^ but also in OPF problem 
|T7|, fT^ . A reserve-constrained OPF problem is developed 
in |17| to incorporate the expected security costs in the system 
using reserve marginal value (reserve price) which indicates 
how much cost is imposed to the system operation if one 
more unit of reserve is needed. The extension of MATPOWER 
4.1 can solve OPF with co-optimization of energy and 
reserve. This capability is employed in our research to solve 
reserve-constrained OPF problems. 

The contribution of this paper has threefold. 

• Two privacy-preserving decision making paradigms based 
on our previous research are developed for multi-horizons 
while considering reserve constraints; 

• Convergence properties for the paradigms are analyzed 
and convergence enhancement measures are proposed and 
explained. 

• Realistic implementation of the decision making 
paradigms at real time is investigated using moving 
horizon optimization technique and demonstrated in case 
studies. 

The rest of the paper is organize as following: the two 
dual decomposition-based paradigms for a multi-horizon OPF 
problem are presented in Section II. Implementation for real¬ 
time operation is investigated and moving horizon optimiza¬ 
tion technique is adopted. The algorithms’ convergence prop¬ 
erties are investigated in Section III. Section VV presents a 
case study to demonstrate the decision making process. The 
conclusion is presented in Section V. 


II. Dual Decomposition-Based Privacy-Preserving 
Decision Making Paradigms 


Consider a power network consisting of a set JV of buses 
and a set £ of branches. The utility is responsible to operate 
the power grid, its generation units and transactions with trans¬ 
mission systems. As shown in Fig. community microgrids 
may have energy storage systems, renewable energy resources, 
and internal loads. These microgrids are connected to the 
network and behave as private agents who will share only 
limited information with the utility. The set A includes all 
buses which the communities are connected to. 

The reserve-constrained 24-hour OPF problem considering 



Fig. I. A community microgrid including a generator, a renewable energy 
system (RES), an energy storage system (ESS) and internal load. 


community microgrids is defined as below: 
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where C{’) is the cost function, superscripts M and m denote 
upper and low limits. Subscripts i ^ J\f and t G T refer to the 
variables corresponding to bus i and hour t where the set T = 
{t G Z|1 < t < 24} denotes the time horizon. Pg, Qg, Pl and 
Ql are the vectors of bus real and reactive power generations, 
and real and reactive loads. Pi) and Ppv denote the battery 
charging power and PV output power respectively. P{V,0) and 
Q{V,0) are the active and reactive power injections in terms 
of bus voltage magnitude and phase angles, and S{V,0) is the 
vector of line complex power fiow. Ei) also determines the 
energy stored in the storage systems. The parameters Rg and 
Ri) denote the reserve provided by the generators and battery 
systems respectively. 

In the optimization problem Q, the objective function ( p^ 
is defined so as to minimize the total generation cost in the 
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entire system. Active and reactive power balance constraints 
are defined in and © for all buses at all hours. The 
dependency between battery charging power and energy stored 
in the battery is stated in Q- The constraints 
describe minimum and maximum limits of the corresponding 
variables. The constraint ( p^ mandates that the ultimate level 
of energy stored in the battery to be equal to that at starting 
point. The minimum level of total reserve from both utility 
generators and the community energy sources is guaranteed 
by ([nil while the constraints ( |lm| )-(p^ take care of the reserve 
constraints. 

Let Timpi,t denotes the utility’s power import from commu¬ 
nity connected to bus i G ^ at time t while Pexpi ^ denotes 
the same community’s power export to the utility. Therefore, 
the power balance equations at Bus i will be replaced by the 
following four equations. 

Pimp,., 

^expi,t ~ ^gi,t ~ ^Li^t ^PVi^t ~ 

Qimpi,t = Qi,t{V, 0 ) 

Qexpi,t ~ Qgi,t ~ Qpi^t QpVi^t ~ Qbi,t 
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A community now only relates to Pexpi,t’ i^^t Pimpi,f 
Global constraints that relate the communities to the utility 
are imposed as 

• ^impi,t ~ Pexpi,t 5 

• Qimpi,t ~ Qexpi,t 5 

• Rdt ~ ^gi,t ~ ^j,t ^ 0- 

ieAf-A jeA 

with the corresponding Lagrangian multipliers also notated. 
In the reserve constraint, we have separated reserve provided 
by the utility generators ^ and the reserve provided by 
a community j Note there are multiple real power and 
reactive power prices at t. However there is one reserve price 
at t. 

Applying Lagrangian relaxation to relax the three global 
constraints in ([^, the main problem is divided into several 
subproblems each of which is solved by the utility or one of 
the communities. The subproblem assigned to the utility is: 


Fig. gives an illustration to show the decomposed systems 
where the utility will treat every community as a generator 
with prices of energy and reserve given and the community 
will treat the main grid as a controllable load with prices of 
energy and reserve also given. 



Fig. 2. Power networks in utility and community optimization problems. 


A. Subgradient-Based Method 


24 

min Y. H 'pi- t-Pinip^. j + Sj,tQ^rnpj 

t=l ieJ^-A jeA 

(4) 

over Pgi_t,Qgi_t 

5-Tinipi 15 i^impi^t 5 
S. i.PZ<P9.,<PA, 

QA < Qg,. < Ql,, 

Ri,t < Ri,t < PA - Pg... ( 6 ) 

Rid ^ O5 ^impi,t ^ 0* 


The community i solves the following problem at each hour 


In subgradient method, the energy price and reserve price 
signals are first specified by the price update center (PUC) 
as illustrated in Fig. The price signals are used by the 
utility and communities to define their power import and 
export levels as well as reserve for all t e T. For the next 
iteration, the energy price values Xi^t foi* the next iteration 
are then updated using the mismatch between corresponding 
power import and export levels. The reserve price value /it 
is also updated to reflect the mismatch of import and export 
level. Fig. also depicts the information flow between these 
three blocks through the algorithm. 

1) Price Updating: Updating the energy prices at each bus 
and the reserve price are the functions assigned to Price Update 
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Fig. 3. Information flow of multi-horizon subgradient algorithm. In this 
flgure, information related to reactive power is not notated. 



Center, utilizing the following equations: 

XI, (k + 1) = XI, ik) + ai,t{k) ■ (Pi^p,, - Pe.p,J (8) 

+ 1) ~ • (Qimpi,t ~ Qexpi,t) (9) 

fit{k + 1) = + Pt{k) • {Rdt ~ ^ ^ ^9i,t ~ ^ V ^j,t) 

ieM-A jeA 

( 10 ) 

where k is the index of updating step, ai^t{k) and Pt{k) are 
positive coefficients. 

2) Community Optimization: In order to define its power 
export level at different hours, each community must perform 
its optimization based on the given price from the PUC. In 
this paper, it is assumed that each community has a generator 
and a defined power load as well as a PV-battery package. 
The optimization problem corresponding to the community 
connected to bus i G ^ is presented in Q. 

Each community receives the updated price for energy and 
reserve, solves its own subproblem 0 and determines the 
amount of power to sell or to export to the main grid (Pexpi,t 
for all t G T) as well as the level of reserve it may provide. 

3) Utility Optimization: For each hour t, the utility updates 
the import level. In this paper, MATPOWER 4.1 | [T9| is 
employed to solve the reserve-constrained OPE problem where 
the the communities are assumed as traders selling their power 
by the price announced by PUC. These values create the power 
import matrix Pimp from different communities at different 
hours as well as the reserve vector P, which are sent to the 
PUC for price updating process for the next iteration. 

B. LUBS Method 

In this structure, communities are responsible to determine 
the optimal price vectors at different hours to be fed into the 
utility’s OPE problems. At each iteration, communities also 
update utility about the maximum and minimum power and 
reserve they can provide. The information fiow structure based 
on LUBS algorithm is illustrated in Fig. 

1) Community Optimization and Price Updating: Once the 
communities receive the utility’s power import information for 
24 hours, they must determine their local generator and battery 
dispatch four 24 hours and announce the corresponding price 
signals back to the utility. 


Fig. 4. Information flow of multi-horizon LUBS algorithm. Information 
related to reactive power is not indicated in the flgure. 

In general, the community connected to the bus i ^ A de¬ 
termines its Lagrange multipliers corresponding to the power 
balance constraints (A^^, A^^). When the generator limits are 
not binding, the Lagrangian multipliers or the prices are same 
as the marginal prices of the generators. 

One of the requirements in LUBS method is that the solution 
sought by the utility for power and reserve must be feasible 
for communities. If Community i’s marginal cost is cheaper 
than the minimum marginal cost of the utility and other 
communities, the utility will demand its maximum generation 
and battery power at all hours. This will be infeasible for 
the battery due to the energy constraints of the battery. 
Communities are then required to update the maximum and 
minimum power limits at each iteration in order to prevent 
such issues. 

Community i also updates the maximum reserve it can 
provide at each iteration according to its dispatched battery 
power as below and sends it to utility, 

i?“(fc + i) = i?^-p,7 + n,.{fc) 

2) Utility Optimization: At iteration k, the utility solves its 
own subproblem given price values Xi^ik) as well as power 
limits to determine the power import vector Pimp for 24 hours 
while considering maximum and minimum limits of power 
and reserve which are announced by communities. The power 
import information is then announced to the communities. The 
procedure to solve utility OPE problem is the same as that in 
the subgradient method. The utility also computes the utility 
cost Cu{k) at k-\h iteration. 

C. Implementation and Moving Horizon Optimization 

We assume a 24-hour PV profile and a 24-hour load profile 
based on forecast. For the current hour, the load and PV 
outputs are known and the future 24 hours are predicted. 
The decision making procedures described above are then 
carried out for 24-hour iteratively. For real-time operation 
implementation, each iteration will be carried out for a time 
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Fig. 5. Time schedule for moving-horizon problems solving. 


period, say two minutes. Every two minutes, the utility and the 
communities exchange information and update their decision 
variables. If an algorithm can converge in ten steps, the 
optimal operating condition can be realized in 20 minutes. 
This “learning process” is achieved by continuous information 
exchange and updating. 

Suppose at Hour k, the learning takes 20 minutes and for the 
rest of the 40 minutes, both the communities and the utilities 
are keeping the process however the generator power output 
and the battery power output will be constant. Moving to Hour 
/c + 1, the load profile now has a change. The communities 
and the utility need to find the optimal battery dispatch and 
generator dispatch again through “learning.” 

The load profile and the PV profile at Hour k-\-l at real-time 
are different from the prediction made at Hour k. Therefore, 
the load and PV profiles should be adjusted to refiect the /c +1 
hour information. Prediction for the future 24 hours needs to 
be made again. Another 24-hour optimization problem has to 
be solved iteratively. 

The above mentioned procedure is termed as moving hori¬ 
zon optimization. To take the underlying power system dy¬ 
namics into consideration, the power command decided by the 
decision making layer should better be continuous. Therefore, 
instead of assuming a same initial price for all time horizons 
(e.g, %^/MWh), it is more desirable to start the next horizon 
“learning” process with the information from the previous time 
horizon. 


HI. Convergence Analysis 

In this section, convergence properties of the two iterative 
methods are examined. For simplicity, the analysis is limited 
to energy price and real power only. 


A. Subgradient Method 

In this method, the price is announced to both the 
utility and a community at iteration k, then they decide how 
much power they would like to exchange with each other. 
Regarding the power level of both sides, the price update 
sector updates the price for the next iteration. If the slopes of 
marginal cost functions corresponding to utility and microgrid 
are respectively ai > 0 and a 2 > 0. Then the import and 


export power can be expressed as: 


pk 

= p° - 

2a* 

-^imp 

imp 


pk 

= p° + 


■^exp 

exp ' 

02 


( 11 ) 

( 12 ) 


( pTj ) also indicates the economic behavior of the utility. The 
cheaper the price, the more power will be purchased. On the 
other hand, the greater the selling price, the more power will 
be sent out by the community as shown in ^V2\ . 

Then we can obtain the iteration of the price. 




l-a[— + — 

ai 02 


A"+«(Tmp-nxp) (13) 


The iteration problems can be viewed as discrete domain 
dynamic problems. Hence, the convergence of the algorithm 
is the same as the stability of the corresponding discrete 
dynamic system. 1 — a 


f— + —) 

\ai a2 J 


should be less than 1. 
Therefore, when a is very small, convergence is guaranteed. 
The critical value of a which is notated as acr = • When 

a < acr, subgradient method converges. When a > acr, the 
algorithm will not converge. For an a equal to critical value, 
the iterations will circulate in a loop. Fig. illustrates the 
possible regions for A^+^ in a case where > Pgxp which 
is divided by the line A^+^ = A^ -h Q(cr(^mp “ ^exp) to 
converging and diverging areas. 


Price ($/MWh) 



Eig. 6. Graphical presentation of different regions of subgradient method 
convergence for various values of a. The algorithm converges if a < acr, 
diverges if a > acr, and circulate in a loop if a = acr- 



































6 


B. LUBS Method 

In this method, a community sets price based on the power 
demand by the utility. The utility then sets its import power 
based on the price. Therefore, the iterative procedure can be 
explained as follows. 

pk ^ pO _ (14) 

imp imp 

Gj\ 

PLp = ^xp + (15) 

CL2 

•El indicates that the utility sets its k-\h step power based on 
a price . indicates that the community updates its price 
based on the power demand from the utility. 

•El and EHf lead to the following iteration. 

Afe+i = + a2(i^V - Pl^) (16) 

til 

The convergence criterion becomes 02 < ui, or the slope of 
marginal cost of a community should be less than that of 
a utility. This condition is very restrictive. The algorithm is 
modified to achieve better convergence. 

The scenarios are shown graphically in Fig. |7] where LUBS 
diverges when the community has more convex generation cost 
than microgrid does, i.e., 02 > ai (Fig|7]-a) and it converges 
otherwise when 02 < ai (Fig [Tj-b). The case of ai = a 2 is 
also considered as non-converging situation since the solutions 
will be trapped in a loop. _ 

The price computed in ( p^ is treated as a prediction (A/^+i). 
Based on this prediction, lOOcr percentage of the update will 
be made. The /c + 1 step price will be: 


Fig. shows (7 modification makes the algorithm converge 
to the optimal point for the case where ai < 02 (Fig|7}a). 



Fig. 8. Graphical presentation of LUBS convergence for different cases, a) 
LUBS diverges when agent Us generation cost is less convex than generator 
2’s, b) LUBS converges if agent I’s generation cost is more convex than 
generator 2’s. 


IV. Case Study 

The proposed two decision making paradigms have been 
implemented in a case study of a radial 42-bus test feeder 
introduced in IEEE Standard 399 p0| . One community is 
connected on Bus 50. The generator cost functions, power 
and reserve limits are described in Table U 

TABLE I 

Parameters and cost eunctions oe generators in IEEE Std. 399 

TEST EEEDER C(Pg) = 0.5aP^ + /3Pg + 7 


^/c+1 — ^/c+1 + Cr(A/c7i — A/e) (17) 

= (1 _ + a (^a 2 (P°„p - Pexp) - ^A'^) (18) 

= (1 _ A'' + aa2{P°^p - Pexp) (19) 


The convergence criteria is then identified as whether 


^ ^ ai+a2 

ai 


< 1 or not. The critical value of a can be ex¬ 


plicitly derived as tJcr = LUBS algorithm converges 

if cr < (Jcr, diverges when a > acr- Iteration will betrapped 
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pM 
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2 
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liGl 
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5 

0 
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42 

0 

29 

fiG2 

0 

3 

0 
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35 

0 

30 

fiGS 

0 

4 

0 

0.2 

38 

0 

50 

liGA 

0 

11 

00 

bo 

0.2 

49 
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The temporal demand profile of the entire test feeder is 
obtained by multiplied by a time-variant scaling factor p{t) as 
shown in Eig. 


in a loop if a = acr- 




Fig. 9. Demand scaling factor in 24-hour horizon, applied to obtain power 
demand profile. 

The utility optimization problem for 24 hours is solved in 
MATPOWER for 24 snapshots. The community optimization 


Fig. 7. Graphical presentation of LUBS convergence for different cases, a) 
LUBS diverges when the slope of the utility’s marginal cost is less than that 
of the community’s, b) LUBS converges if the slope of the utility’s marginal 
cost is greater than that of the community’s. 


problem is carried out by CVX ED Simulation results 
are presented in the following figures Eigs. pBp~5| The 24- 
hour profiles or 8-hour profiles clearly demonstrate how the 
“learning” processes took place. 
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Time (hour) 

Fig. 10. Comparison of community nodal price behavior through iterations in 
LUBS and subgradient for 24 consecutive moving-horizon simulations, where 
each iteration is assumed to take two minutes, a) The price in LUBS method 
throughout an entire day, b) the price in subgradient method throughout an 
entire day, c) a closer look on the price in both methods during a two-hour 
horizon. 



Fig. 11. A utility generator’s power output profile. 
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Fig. 13. 8-hour importing/exporting power profiles. 
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Fig. 12. The community’s generator and battery prohle. 


Fig. 14. 8-hour reserve price prohle. 

Remarks: the comparison of LUBS and subgradient-based 
method shows that for real-world implementation, LUBS is 
more favorable due to the smoother change in the price signals 
and power levels. 

V. Conclusion 

Dual decomposition-based privacy preserving decision mak¬ 
ing paradigms for utility-community interaction for multiple 
horizons are developed. The developed paradigms enable 
utility and communities to make their own decisions based on 
limited information exchange. The paradigms are suitable for 
radial connected utility and communities. Implementation for 
real-time operation scenarios takes care of forecast and real¬ 
time discrepancy by using moving horizon optimization tech¬ 
nique and by initiating the next horizon variables based on the 
information from the previous horizon. The information flow 
architectures are explained and the convergence properties 























































































































































































































































Fig. 15. 8-hour reserve profile. 

are investigated. The proposed decision making architectures 
were tested by a case study and demonstrate the “learning” 
processes taken by a utility and a community. 
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